Emergent collective behaviors in a multi-agent reinforcement learning based pedestrian simulation
نویسندگان
چکیده
In this work, a Multi-agent Reinforcement Learning framework is used to get plausible simulations of pedestrians groups. In our framework, each virtual agent learns individually and independently to control its velocity inside a virtual environment. The case of study consists on the simulation of the crossing of two groups of embodied virtual agents inside a narrow corridor. This scenario permits us to test if a collective behavior, specifically the lanes formation is produced in our study as occurred in corridorswith real pedestrians. The paper studies the influence of different learning algorithms, function approximation approaches, and knowledge transfer mechanisms in the performance of the learned pedestrian behaviors. Specifically, two different RL-based schemas are analyzed. The first one, Iterative Vector Quantization with Q-Learning (ITVQQL) improves iteratively a state-space generalizer based on vector quantization. The second scheme, named TS, uses Tile coding as the generalization method with the Sarsa(λ) algorithm. Knowledge transfer approach is based on the use of Probabilistic Policy Reuse to incorporate previously acquired knowledge in current learning processes; additionally, value function transfer is also used in the ITVQQL schema to transfer the value function between consecutive iterations. The results demonstrate empirically that our RL framework generates individual behaviors capable of emerging the expected collective behavior as occurred in real pedestrians. This collective behavior appears independently of the generalization method used, but depends extremely on whether knowledge transfer was applied or not. In addition, the use of transfer techniques has a notable influence in the final performance (measured in number of times that the task was solved) of the learned behaviors. A video of the simulation is available at the URL: http://www.uv.es/agentes/RL/index.htm
منابع مشابه
MARL-Ped: A multi-agent reinforcement learning based framework to simulate pedestrian groups
Pedestrian simulation is complex because there are different levels of behavior modeling. At the lowest level, local interactions between agents occur; at the middle level, strategic and tactical behaviors appear like overtakings or route choices; and at the highest level path-planning is necessary. The agent-based pedestrian simulators either focus on a specific level (mainly in the lower one)...
متن کاملMulti-agent Reinforcement Learning for Simulating Pedestrian Navigation
In this paper we introduce a Multi-agent system that uses Reinforcement Learning (RL) techniques to learn local navigational behaviors to simulate virtual pedestrian groups. The aim of the paper is to study empirically the validity of RL to learn agent-based navigation controllers and their transfer capabilities when they are used in simulation environments with a higher number of agents than i...
متن کاملDetection of Primitive Collective Behaviours in a Crowd Panic Simulation Based on Multi-Agent Approach
We propose an approach towards multi-agent system for simulation and detection of primitive collective behaviors emerging from a crowd in panic. This paper presents various works on which our method is based, by methods of planning and decisions allowing emergence of primitive collective behaviors. We present then an implementation in a virtual environment and detection experiments of emergent ...
متن کاملCollective Robots Navigation by Reinforcement Learning Mechanisms with Common Knowledge Field ––an Approach for Heterogeneous-agents Systems––
In this study, we propose a new approach to realize a reinforcement learning scheme for heterogeneous multiagent systems. In our approach, we treat the collective agents systems in which there are multiple autonomous mobile robots, and given tasks are achieved based on the collective behavior approach. Also, each agent organizes and refines its knowledge for executing its own behaviors by reinf...
متن کاملMultiagent Supervised Training with Agent Hierarchies and Manual Behavior Decomposition
We present a supervised learning from demonstration system capable of training stateful and recurrent behaviors, both in the single agent and multiagent case. Furthermore, behavior complexity due to statefulness and multiple agents can result in a high dimensional learning space, which can require many samples to learn properly. Our approach, which relies heavily on both per-agent behavior deco...
متن کامل